Search CORE

248 research outputs found

Speech-in-noise perception is linked to rhythm production skills in adult percussionists and non-musicians

Author: Abercrombie D.
Brown L.
Cross I.
Dauer R. M.
Iversen J. R.
Lehiste I.
Liberman M.
Patel A. D.
Publication venue: 'Informa UK Limited'
Publication date: 12/12/2017
Field of study

Speech rhythms guide perception, especially in noise. We recently revealed that percussionists outperform non-musicians in speech-in-noise perception, with better speech-in-noise perception associated with better rhythm discrimination across a range of rhythmic expertise. Here, we consider rhythm production skills, specifically drumming to a beat (metronome or music) and to sequences (metrical or jittered patterns), as well as speech-in-noise perception in adult percussionists and non-musicians. Given the absence of a regular beat in speech, we hypothesise that processing of sequences is more important for speech-in-noise perception than the ability to entrain to a regular beat. Consistent with our hypothesis, we find that the sequence-based drumming measures predict speech-in-noise perception, above and beyond hearing thresholds and IQ, whereas the beat-based measures do not. Outcomes suggest temporal patterns may help disambiguate speech under degraded listening conditions, extending theoretical considerations about speech rhythm to the everyday challenge of listening in noise

Crossref

Birkbeck Institutional Research Online

Speech rhythm: a metaphor?

Author: Abercrombie D
Barry WJ
Carter PM
Couper-Kuhlen E
Cummins F
Dankovičová J
Dellwo V
Dellwo V
Eriksson A
Fletcher J
Francis Nolan
Gibbon D
Grabe E
Hae-Sung Jeon
Kim J-M
Koreman J
Lehiste I
Lin H
Lloyd James A
Mok P
Nazzi T
O'Dell M
O'Rourke E
Patel AD
Pike KL
Platt JT
Ramus F
Ramus F
Shattuck-Hufnagel S
Tongue RK
White L
White L
Windmann A
Zvonik E
Publication venue: 'The Royal Society'
Publication date: 11/11/2014
Field of study

Is speech rhythmic? In the absence of evidence for a traditional view that languages strive to coordinate either syllables or stress-feet with regular time intervals, we consider the alternative that languages exhibit contrastive rhythm subsisting merely in the alternation of stronger and weaker elements. This is initially plausible, particularly for languages with a steep ‘prominence gradient’, i.e. a large disparity between stronger and weaker elements; but we point out that alternation is poorly achieved even by a ‘stress-timed’ language such as English, and, historically, languages have conspicuously failed to adopt simple phonological remedies that would ensure alternation. Languages seem more concerned to allow ‘syntagmatic contrast’ between successive units and to use durational effects to support linguistic functions than to facilitate rhythm. Furthermore, some languages (e.g. Tamil, Korean) lack the lexical prominence which would most straightforwardly underpin prominence alternation. We conclude that speech is not incontestibly rhythmic, and may even be antirhythmic. However, its linguistic structure and patterning allow the metaphorical extension of rhythm in varying degrees and in different ways depending on the language, and that it is this analogical process which allows speech to be matched to external rhythms

CLoK

Crossref

PubMed Central

Perceptual learning of time-compressed and natural fast speech

Author: Amitay S.
Browman C.
Cormier S. M.
Delhommeau K.
Dupoux E.
Esther Janse
Green K. P.
Haskell R. E.
Lehiste I.
Max L.
McClaskey C.
Miller J. L.
Patti Adank
Pavlovskaya M.
Pinheiro J.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2009
Field of study

Speakers vary their speech rate considerably during a conversation, and listeners are able to quickly adapt to these variations in speech rate. Adaptation to fast speech rates is usually measured using artificially time-compressed speech. This study examined adaptation to two types of fast speech: artificially time-compressed speech and natural fast speech. Listeners performed a speeded sentence verification task on three series of sentences: normal-speed sentences, time-compressed sentences, and natural fast sentences. Listeners were divided into two groups to evaluate the possibility of transfer of learning between the time-compressed and natural fast conditions. The first group verified the natural fast before the time-compressed sentences, while the second verified the time-compressed before the natural fast sentences. The results showed transfer of learning when the time-compressed sentences preceded the natural fast sentences, but not when natural fast sentences preceded the time-compressed sentences. The results are discussed in the framework of theories on perceptual learning. Second, listeners show adaptation to the natural fast sentences, but performance for this type of fast speech does not improve to the level of time-compressed sentences

Crossref

UCL Discovery

The University of Manchester - Institutional Repository

MPG.PuRe

On the Tail of the Scottish Vowel Length Rule in Glasgow

Author: Agutter A.
Aitken A. J.
Aitken A. J.
Beckman M. E.
Britain D.
Fromond R.
Gregg R. J.
Harrington J.
Harrington J.
Hayes B.
Hewlett N.
Jane H. Stuart-Smith
Johnston P.
Labov W.
Lass R.
Lehiste I.
Liberman M.
Llamas C.
Macaulay R.
Mendoza-Denton N.
Milroy L.
Munhall K.
Ohala J.
Scobbie J. M
Scobbie J. M.
Scobbie J. M.
Scobbie J. M.
Scobbie J. M.
Solé M. J.
Stuart-Smith J.
Tamara V. Rathcke
Tauberer J.
Trudgill P.
Watt D.
White L. S.
Publication venue: 'SAGE Publications'
Publication date: 01/12/2015
Field of study

One of the most famous sound features of Scottish English is the short/long timing alternation of /i u ai/vowels, which depends on the morpho-phonemic environment, and is known of as the Scottish Vowel Length Rule (SVLR). These alternations make the status of vowel quantity in Scottish English (quasi-)phonemic but are also susceptible to change, particularly in situations of intense sustained dialect contact with Anglo-English. Does the SVLR change in Glasgow where dialect contact at the community level is comparably low? The present study sets out to tackle this question, and tests two hypotheses involving (1) external influences due to dialect-contact and (2) internal, prosodically-induced factors of sound change. Durational analyses of /i u a/ were conducted on a corpus of spontaneous Glaswegian speech from the 1970s and 2000s, and four speaker groups were compared, two of middle-aged men, and two of adolescent boys. Our hypothesis that the development of the SVLR over time may be internally constrained and interact with prosody was largely confirmed. We observed weakening effects in its implementation which were localised in phrase-medial unaccented positions in all speaker groups, and in phrase-final positions in the speakers born after the Second World War. But unlike some other varieties of Scottish or Northern English which show weakening of the Rule under a prolonged contact with Anglo-English, dialect contact seems to be having less impact on the durational patterns in Glaswegian vernacular, probably because of the overall reduced potential for a regular, everyday contact in the West given the different demographies

KOPS - The Institutional Repository of the University of Konstanz

Crossref

Kent Academic Repository

Enlighten

Prosodic tools for language learning

Author: A. Batliner
D. Klatt
D. M. Chun
D. Pisoni
D. S. Hurley
E. Shriberg
F. Ramus
F. Ramus
I. Lehiste
J. Bernstein
J. D. Bowen
L. Loveday
M. Eskénazi
M. J. Luthy
N. Umeda
O. R. Kelm
P. M. Bertinetto
R. Delmonte
R. Delmonte
R. Delmonte
Rodolfo Delmonte
S. Hiller
W. Campbell
Y. Medan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

In this paper we will be concerned with the role played by prosody in language learning and by the speech technology already available as commercial product or as prototype, capable to cope with the task of helping language learner in improving their knowledge of a second language from the prosodic point of view. The paper has been divided into two separate sections: Section One, dealing with Rhythm and all related topics; Section Two dealing with Intonation. In the Introduction we will argue that the use of ASR (Automatic Speech Recognition) as Teaching Aid should be under-utilized and should be targeted to narrowly focussed spoken exercises, disallowing open-ended dialogues, in order to ensure consistency of evaluation. Eventually, we will support the conjoined use of ASR technology and prosodic tools to produce GOP useable for linguistically consistent and adequate feedback to the student. This will be illustrated by presenting State of the Art for both sections, with systems well documented in the scientific literature of the respective field. In order to discuss the scientific foundations of prosodic analysis we will present data related to English and Italian and make comparisons to clarify the issues at hand. In this context, we will also present the Prosodic Module of a courseware for computer-assisted foreign language learning called SLIM—an acronym for Multimedia Interactive Linguistic Software, developed at the University of Venice (Delmonte et al. in Convegno GFS-AIA, pp. 47–58, 1996a; Ed-Media 96, AACE, pp. 326–333, 1996b). The Prosodic Module has been created in order to deal with the problem of improving a student’s performance both in the perception and production of prosodic aspects of spoken language activities. It is composed of two different sets of Learning Activities, the first one dealing with phonetic and prosodic problems at word level and at syllable level; the second one dealing with prosodic aspects at phonological phrase and utterance suprasegmental level. The main goal of Prosodic Activities is to ensure consistent and pedagogically sound feedback to the student intending to improve his/her pronunciation in a foreign language

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

It's not what you say but the way that you say it: an fMRI study of differential lexical and non-lexical prosodic pitch processing

Abstract Background This study aims to identify the neural substrate involved in prosodic pitch processing. Functional magnetic resonance imaging was used to test the premise that prosody pitch processing is primarily subserved by the right cortical hemisphere. Two experimental paradigms were used, firstly pairs of spoken sentences, where the only variation was a single internal phrase pitch change, and secondly, a matched condition utilizing pitch changes within analogous tone-sequence phrases. This removed the potential confounder of lexical evaluation. fMRI images were obtained using these paradigms. Results Activation was significantly greater within the right frontal and temporal cortices during the tone-sequence stimuli relative to the sentence stimuli. Conclusion This study showed that pitch changes, stripped of lexical information, are mainly processed by the right cerebral hemisphere, whilst the processing of analogous, matched, lexical pitch change is preferentially left sided. These findings, showing hemispherical differentiation of processing based on stimulus complexity, are in accord with a 'task dependent' hypothesis of pitch processing.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

King's Research Portal

Structure sharing—The case of free relatives in Serbian

Author: Bošković Z.
Bošković Z.
Bresnan J.
Chomsky N.
Chomsky N.
Citko B.
Dayal V.
Gračanin-Yuksek M.
Groos A.
Grosu A.
Halpern A.
Larson R.
Lehiste I.
Nataša Milićević
Progovac L.
Radanović-Kocić V.
Riemsdijk H.
Riemsdijk H.
Selkirk E. O.
Selkirk E. O.
Stjepanović S.
Vries M.
Ćavar D.
Publication venue: 'Akademiai Kiado Zrt.'
Publication date
Field of study

Crossref

Timing in talking: What is it used for, and how is it controlled?

Author: Alice Turk
Beckman ME
Beckman ME
Browman CP
Cambier-Langeveld T
Caspers J
Cho T
Chomsky N
Halle M
Hayes B
Heuven VJJP
Ivry R
Jackendoff R
Jurafsky D
Keating P
Keating P
Keating P
Lehiste I
Mo Y
Nam H
Nespor M
Nooteboom S
Ogden R
Pierrehumbert J
Rosenbaum DA
Rosenbaum DA
Saltzman E
Selkirk EO
Selkirk EO
Shattuck-Hufnagel S
Stefanie Shattuck-Hufnagel
Tanaka H
Turk A
Turk A
Vaissière J
Wing AM
Publication venue: 'The Royal Society'
Publication date: 19/12/2014
Field of study

In the first part of the paper, we summarize the linguistic factors that shape speech timing patterns, including the prosodic structures which govern them, and suggest that speech timing patterns are used to aid utterance recognition. In the spirit of optimal control theory, we propose that recognition requirements are balanced against requirements such as rate of speech and style, as well as movement costs, to yield (near-)optimal planned surface timing patterns; additional factors may influence the implementation of that plan. In the second part of the paper, we discuss theories of timing control in models of speech production and motor control. We present three types of evidence that support models of speech production that involve extrinsic timing. These include (i) increasing variability with increases in interval duration, (ii) evidence that speakers refer to and plan surface durations, and (iii) independent timing of movement onsets and offsets

Crossref

PubMed Central

Edinburgh Research Explorer